A New Approach for Clustering Gene Expression Profiles

نویسندگان

  • TENG Li
  • LI Hong-yu
  • I-fan Shen
چکیده

Microarray experiments generate a considerable amount of data. Analyzing those data properly would help us gain a huge amount of biologically relevant information about the global cellular behavior. Clustering is one of the first steps in data analysis of high-throughput expression measurements. Many clustering algorithms have proved useful to make sense of such data. These algorithms, though useful, suffer from several drawbacks. Here, we propose an iterative two-step clustering algorithm which tackles some of these drawbacks. In the first step, a new graph-theoretic approach is introduced to locate clusters. In the second step, the radius(or size) of each cluster is identified adaptively. Our method doesn’t need to predefine the cluster number or cut the tree structure as K-means or hierarchical clustering does. The algorithm is successfully validated using existing data sets and can outperform hierarchical and K-means clustering in some aspects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

خوشه‌بندی داده‌های بیان‌ژنی توسط عدم تشابه جنگل تصادفی

Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...

متن کامل

بازشناسی جلوه‌های هیجانی با استفاده از تحلیل تفکیک پذیری مبتنی بر خوشه بندی چهره

Improvement of Facial expression recognition is aim of proposed method. This is a new formulation to the linear discriminant analysis. In the new formulation within-class and between-class covariance matrix are estimated on the each cluster and in the test phase new samples are mapped to the subspace that is related to the cluster of them. At the first we addressed clustering analysis of faces ...

متن کامل

Mesenchymal Stem/Stromal-Like Cells from Diploid and Triploid Human Embryonic Stem Cells Display Different Gene Expression Profiles

Background: Human ESCs-MSCs open a new insight into future cell therapy applications, due to their unique characteristics, including immunomodulatory features, proliferation, and differentiation. Methods: Herein, hESCs-MSCs were characterized by IF technique with CD105 and FIBRONECTIN as markers and FIBRONECTIN, VIMENTIN, CD10, CD105, and CD14 genes using RT-PCR technique. FACS was performed fo...

متن کامل

به کارگیری روش‌های خوشه‌بندی در ریزآرایه DNA

Background: Microarray DNA technology has paved the way for investigators to expressed thousands of genes in a short time. Analysis of this big amount of raw data includes normalization, clustering and classification. The present study surveys the application of clustering technique in microarray DNA analysis. Materials and methods: We analyzed data of Van’t Veer et al study dealing with BRCA1...

متن کامل

Multivariate Feature Extraction for Prediction of Future Gene Expression Profile

Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997